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THE SHATTERING DIMENSION OF SETS OF LINEAR 
FUNCTIONALS 
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We evaluate the shattering dimension of various classes of linear 
functionals on various symmetric convex sets. The proofs here relay 
mostly on methods from the local theory of normed spaces and in- 
clude volume estimates, factorization techniques and tail estimates of 
norms, viewed as random variables on Euclidean spheres. The esti- 
mates of shattering dimensions can be applied to obtain error bounds 
for certain classes of functions, a fact which was the original moti- 
vation of this study. Although this can probably be done in a more 
traditional manner, we also use the approach presented here to deter- 
mine whether several classes of linear functionals satisfy the uniform 
law of large numbers and the uniform central limit theorem. 

1. Introduction. Combinatorial dimensions, such as the Vapnik-Chervo- 
nenkis dimension, and the shattering dimension, are parameters which mea- 
sure the richness of a given class of functions. The Vapnik-Chervonenkis 
dimension (VC dimension) of a class of {0, l}-valued functions is the largest 
dimension of a combinatorial cube that can be found in a coordinate pro- 
jection of the class, that is, in a restriction of the class to a finite subset 
of the domain. In this article we focus on a real valued analog of the VC 
dimension, called the shattering dimension; it is a scale sensitive parameter 
that measures the largest dimension of a "cube" of a given side length that 
can be found in a coordinate projection of the class. 

Definition 1.1. For every e > 0, a set a = {xi,...,x n } C $7 is said 
to be e-shattered by a set F of functions on f2 if there is some function 
s : cr — > R, such that for every I C {1, . . . , n}, there is some fj £ F for which 
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fi{%i) > s(xi) + e if i € J, and fi(xi) < s(xi) — e if 2 ^ /. The shattering 
dimension of -F is the function 

VC(e,F, fi) = sup{|o-|o- Cfl, a is e-shattered by F}. 

fi is called the shattering function of the set / and the set {s(xi)\xi £ a} is 
called a witness to the e-shattering. In cases where the underlying space is 
clear we denote the shattering dimension by VC(e,F). 

In this article we evaluate the shattering dimension of various classes of 
linear functionals on various symmetric convex sets. Before describing the 
actual results obtained, we would like to describe the way one applies such 
estimates to obtain results concerning the uniform law of large numbers 
and the uniform central limit theorem (CLT), as well as error bounds in 
statistical learning theory. 

Combinatorial dimensions have been frequently used in the theory of em- 
pirical processes, mostly in the context of the uniform law of large numbers 
and the uniform CLT. Recall the definition of the uniform law of large num- 
bers, also known as the uniform Glivenko-Cantelli condition. 

Definition 1.2. Let F be a class of functions. We say that F is a 
uniform Glivenko Cantelli class (uGC class) if for every e > 0, 




where {Xi) c ^ l are independent random variables distributed according to //. 

Let us remark that in this article we ignore the question of measurability, 
since only mild assumptions on the class, such as admissibility, are required 
to resolve this issue (see [6] for further details). Moreover, in all the cases 
we explore, it suffices to consider the supremum over a countable dense set, 
and, thus, the measurability issue does not arise. 

Vapnik and Chervonenkis proved that (under mild measurability assump- 
tions) a class of binary-value functions is a uGC class if and only if it has 
a finite VC dimension [23], and this result was extended in [1] to the real- 
valued case, where it was shown that a class of uniformly bounded functions 
is a uGC class if and only if VC(e,F) is finite for every e > (see also [7] 
for a related earlier characterization of uGC classes of functions). 

The shattering dimension can be used to obtain the tail bounds needed 
in (1.1), using the following line of argumentation. The starting point is a 
version of Talagrand's inequality (originally proved in [20]) due to Bousquet. 
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Theorem 1.3 ([5]). Let F be a class of functions defined on a probability 
space (£l,fj,) such that svpf^p ||/||oo < 1. Let (Xj)" =1 be independent random 



varia 



bles distributed according to \i, put a > supj eF Var[/(Xi)] and set Z = 
SU P/GE I T2=l(f( X i) - • T hen, for every x > 0, 

(1.2) PT{Z>EZ + x}<exp(^-vh(^yj, 

where v = na 2 + 2EZ and h{x) = (1 + x) log(l + x) — x. Moreover, for every 
x>0, 

(1.3) Pr|z>EZ + V2^+|| <e~ x . 

In order to apply this result and obtain uniform deviation estimates one 
needs to bound MZ. By symmetrization, 



(1.4) E M Z<2E^ xe sup 

f€F 



where (£i)f =1 are independent Rademacher random variables (i.e., take the 
values ±1 with probability 1/2 each). It turns out that the Rademacher 
averages on the right-hand side of (1.4) can be estimated in terms of the 
empirical covering numbers. 

If (Y, d) is a metric space and F CY, then for every e > 0, N(e, F, d) 
denotes the minimal number of open balls (with respect to the metric d) 
needed to cover F. 

Definition 1.4. For every class F let the empirical covering numbers 

be 

N(e, F, n) = sup N(e, F, L 2 (Mn)), 

fin 

where the supremum is taken with respect to all empirical measures n~ l J27=i 
supported on n points, log N(e, F) = sup„ log N(e, F, n) is called the uniform 
L2 entropy of F. 

The following result shows that the uniform entropy can be bounded via 
the combinatorial parameters. 

Theorem 1.5 ([16]). There are absolute constants K and c such that 
for any class F which consists of functions bounded by 1 and every < e < 1, 

>2\K-VC(ce,F) 
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Combining Theorem 1.5 with a chaining argument, one can bound the 
Rademacher averages of (1.4) and, thus, EZ and obtain the necessary devi- 
ation estimates. In all the examples presented in the sequel we will establish 
upper bounds on the shattering dimension which are polynomial in 1/e, and 
in that case, the following holds. 

Theorem 1.6 ([15]). Let F be a class of functions bounded by 1, and 
set Z to be as in Theorem 1.3. Assume that there are 7 > 1 and <p < 00 
such that VC(e,F) < -yE~ p . Then 

(Vn, 

EZ<C p7 1/2 I y^log^n, 

[ n i-i/Pl og i/p n) 

where C p are constants which depend only on p. 

We now turn to the description of the connection between bounds on 
the shattering dimension and error bounds used in the analysis of regression 
problems in nonparametric statistics and, more recently, in Learning Theory. 
In both applications, combinatorial parameters have played an important 
role. In the context of Learning Theory, they were used to estimate the size 
of a random sample needed to construct an almost optimal approximation 
of an unknown target function by an element in a fixed class of functions, 
where the given data are a sample (Xi)f =1 and the values of the target on 
the sample [2, 15]. Such an error bound which is based on the shattering 
dimension is presented in the next theorem, which was adapted from [4]. 

Theorem 1.7. Let (f2,/x) be a probability space, let F be a class of 
measurable functions on ft with ranges in [—1,1] and assume that there is 
a constant B > 1 such that for every f G F, E^/ 2 < B¥, fL f . If (Xi)f =1 are 
independent random variables distributed according to \x, then for any x > 0, 
there is a set of probability larger than 1 — 2e~ x , on which for any f G F, 

E M /<-E/(X i ) + c(^= + ^), 
n ~^ V v n n / 

where C is an absolute constant and 

(1.5) I = £ ^VC(e, F, {X\ , X n }) logQ) de. 

Let us mention that it is possible to obtain error bounds even in some 
cases when I = 00 [14], and that in [4], error bounds with faster rates of 
convergence than were established in the same setup. 



i/0<p<2, 
ifp = 2, 
ifp>2, 
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The analysis of the shattering dimension of classes of linear functionals 
we present is based on methods from the local theory of normed spaces. We 
show that for such classes the shattering dimension is determined by the 
geometry of the class and the domain, which is expressed by the ability to 
factor a certain operator through First, we investigate in Section 3 the 
case when Q is the unit ball of some Banach space and F is the dual unit 
ball. We show that if X is infinite dimensional and Bx is the unit ball of X , 
the shattering dimension VC(e, Bx*,Bx) is determined by the Rademacher 
type of X . In Section 4, which contain the main new results of this article, 
we use a volumetric argument and establish estimates on the shattering 
dimension when both the class and the domain are finite-dimensional convex 
and symmetric sets. We then compute the shattering dimension of the unit 
ball in 1™ when considered as functions on the unit ball of 1™ , 1 < p, q < oo, 
and show that in many cases the volumetric approach yields sharp bounds. 
For example, we prove in Theorem 4.12 that for 1 < p < q < oo and for 
F = B q >, the unit ball of l q < (q' is the conjugate index to q), and 17 = B p , 
VC(e,F,Q) is given, up to constants depending only on p and q, by the 
following expressions: 



Section 5 is devoted to computation of the shattering dimension of the image 
of the unit ball of under a linear transformation. 

The applications of the estimates of the shattering dimensions to the 
determination of whether some classes of functionals satisfy the uniform law 
of large numbers or the uniform CLT are scattered through Sections 3 and 4. 
In Section 3 we give, among other things, a new proof for a result from [7] 
giving a necessary and sufficient condition for the unit ball Bx* of a dual 
Banach space to be uGC class on Bx- In Section 4 our results are used to 
investigate the following problem: consider the unit ball of £ q , denoted by 
B q , as functions on the unit ball of £ p . Does this class of functions satisfy 
the uniform CLT on this domain? In general, one can show that for any 
infinite-dimensional Banach space X , F = Bx* , does not satisfy the uniform 
CLT on the domain f2 = Bx- Although this can probably be deduced from 
earlier contributions, we show, as an application of the methods presented 
here, that whenever p < q, F = B q > satisfies the uniform CLT on the domain 



2. Preliminaries. Throughout all absolute constants are denoted by c, C 
or K. Their values may change from line to line or even within the same line. 
Cm, Cm denote constants which depend only on the parameter (p (which is 
usually a real number p or a couple of real numbers p, q), and a b means 




if l<p<2, 
if 2 <p< oo. 



n = Bp. 
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that Cpb <a< C^b. If the constants are absolute, we use the notation a ~ b. 
Given a real Banach space X, let Bx or B{X) be the unit ball of X. The 
dual of X, denoted by X* , consists of all the bounded linear functionals 
on X, endowed with the norm ||x*|| =supi| 3 ,i| =1 For every integer n, 

we fix the Euclidean structure (•, •) on W 1 with an orthonormal basis denoted 
by {ei)U- 

A set K is called symmetric if the fact that x S K implies that — x 6 K. 
The symmetric convex hull of K, denoted by absconv(i^), is the convex hull 
of K U —K. 

If K C W 1 is bounded, convex and symmetric with a nonempty interior, 
then K is a unit ball of a norm denoted by || • \\k- It is possible to show that 
the polar of K, defined by 

K° = |i£ 



jxeR n sup(fc,x> < 1 



is the unit ball of the dual space of (]R n , || • \\k)- In the sequel we shall abuse 
notation and denote by K the normed space whose unit ball is K. From 
here on, a ball will be a bounded, convex and symmetric subset of M n , with 
a nonempty interior. 

If 1 < p < oo, let 11 be R n endowed with the norm || £? =1 o^Hp = (£2=1 N P ) 1/p - 
i^o is K n endowed with the norm || J27=i a i e i\\oo = su Pi \ a i\- Bp is the unit 
ball of £p, and for every 1 < p < oo, (Bp)° = B™,, where 1/p + l/p' = 1. In 
this case, p' is called the conjugate index of p. 



2.1. Volume estimates. As stated above, we can identify with W 1 . 
Hence, P\ is endowed with the n-dimensional Lebesgue measure, denoted 
by | • | . Let GL n be the set of invertible operators T : M. n — > M. n , and note that 
for every measurable set A C M. n and every T S GL n , \TA\ = \ det(T)| \A\ . We 
say that a set A C W 1 is an ellipsoid if there is some T € GL n , such that 
A = TB%. 

It will be useful to determine the volume of the balls BL 1 and the volume 
of their sections. First, let us mention the following well-known fact. 

Theorem 2.1 ([19]). There are absolute constants C and c such that 
for every integer n and every 1 <p< oo, 

cn~ 1/p <\B p l \ 1/n <Cn~ 1/p . 

Unlike the clear structure of sections of B^ , the geometry of sections of B™ 
is far less obvious. The following result, due to Meyer and Pajor [17], bounds 
the volume of ^-dimensional sections of B™. 
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Theorem 2.2. For every k- dimensional subspace E C M n and every 1 < 
p <q< oo, 

\B2f\E\ \B?r\E\ 
< 



\B k \ ~ \B k \ 

I -"pi \ J -'q\ 

By selecting q = 2, it follows that for 1 < p < 2, the volume of any fc-dimensional 
section of B™ is smaller than the volume of B k Similarly, by taking p = 2, 
the volume of any /c-dimensional section of B™ for 2 < q < oo is larger than 
the volume of B k . 

Remark 2.3. A similar result holds in the infinite-dimensional case. In 
particular, it follows that for any 1 < p < q < oo and for any n-dimensional 
subspace E, 

An important fact about the volume of balls are the Santalo and inverse 
Santalo inequalities. 

Theorem 2.4. There is an absolute constant c such that for every in- 
teger n and every ball K C M. n , 

f\K\\K°\\ 1 / n 

The upper bound was established by Santalo, while the lower bound is 
due to Bourgain and Milman. The proof of both results can be found in [19]. 

One of the tools used in modern convex geometry is the notion of volume 
ratios. The idea is to compare the volume of a given ball with the "best" 
possible volume of an ellipsoid contained in it, since this may be used to 
understand "how close" the norm induced by the ball is to a Euclidean 
structure. 

Definition 2.5. For every ball K C M", the volume ratio of K is 

\K\ \ l l n 



vr(K) = inf 



\TB%\ 



where the infimum is taken with respect to all T S GL n such that TB^ C K. 
The external volume ratio is defined as 



evr(fT) = inf 



TB^\\^ n 



\K\ 

where the infimum is with respect to all T £ GL n such that K C TB^ ■ 
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It is possible to show [19] that both infimums in the definition above are 
uniquely attained. Hence, for every ball K C K" ', there is an ellipsoid of 
maximal volume contained in K and an ellipsoid of minimal volume con- 
taining K. The ellipsoid of maximal volume contained in K is denoted by 
Ex-, and the ellipsoid of minimal volume containing K is denoted by £k- 
Note that for every ball K , £# = Ek° ■ 

It follows from the definitions that if K is an ellipsoid, then vt(K) = evi(K) = 1. 
Moreover, it is known that for every ball K C M n , vx{K ) < \fn. More pre- 
cisely, the volume ratio of i^, which is of the order of y/n, is the worst 
possible. 

Theorem 2.6 ([3]). For every integer n, 

4 



vr(K)<vr(^) 



\ B 2 \ 1/r 



Another result we require is an estimate on the volume ratios of projec- 
tions of £ p . 

Theorem 2.7 ([12]). For every integer n, 

, PR , fl, 1<P<2, 
sup \t{PeB p ) ~ < . . 

EcM n p [n 1 / 2 " 1 ^, 2<p<oo, 

where the supremum is taken with respect to all the projections onto n di- 
mensional subspaces of £ p . 

A different notion of volume ratios is the cubic ratios which was intro- 
duced by Ball [3]. For every ball K C M n , let 

\TB n I \ 1 / n 



cr(i^) = inf 



TeGL n ,KcTB^\ \K\ 

Lemma 2.8 ([3]). There are absolute constants c and C such that for 
every integer n and every ball K C W 1 , 

cyfTu < vt(K)ct(K) < Cyjn. 

Finally, we can define the volume numbers of an operator. We follow the 
definition used by Gordon and Junge [11, 12]. 

Definition 2.9. Given Banach spaces X and Y, an operator T : X — > Y 
and an integer n, let the nth volume number of T be 



(\T(B x nE)\\V n 



^ (T)=SUP U \ByHF 



EcX, T(E) C F CY, dim E = dimF = n 
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Note that if T is of rank smaller than n, v n (T) = 0. Also, it is clear that 
the volume numbers are submultiplicative, that is, v n (TiT%) < v n (Ti)v n (T2) . 
If T is an operator between Hilbert spaces, then the volume numbers may 
be calculated using the eigenvalues (Aj) of y/T*T (which are arranged in a 
nonincreasing order). In that case, for every integer n, v n (T) = (JliLi ^i) 1 ^" '• 

Another example in which the volume numbers may be estimated is for 
the formal identity operator id:£" 1 — > l™. By Theorem 2.2 it is evident that 
for every n <m and any 1 < p < q < oo, 

(2.2) vJidem^im) = sup J-5 ^< 4n <C pq n 1/g ~ 1/p 

P q dimi=n l^n^l 1 /" " |^n|l/n - P.9 

and clearly also 

(2.3) v n (idem^(m) > 



r>n 1 1/n 
P 1 

-"g I 



In general, if p > q, then 

(2.4) U n (M^m_+£m) < ||id||^m^m = m 1/<? ~ 1/:P , 

and this estimate is optimal, at least in cases where n divides m. To see 
this, let k = m/n and for j = l,...,n, let vj = J2i=i e j+k(i-i)- Note that 
for each r, span{ui, . . . ,v n } n = .E n £?™ has volume (m/n) 1 / 2 ~ 1 / r |l?™| . 
Thus, 

|£ m n£| 1/n / m \ l/g-l/p |5 n | 1/n 



\n J LB" 1 /" 



iB^n.Bl 1 /™ 

proving that the bound on the volume numbers is tight. 

2.2. T/ie uniform CLT. The fact that the shattering dimension can be 
used to bound the uniform entropy will enable us to show that some classes 
of functionals satisfy the uniform CLT. Recall that a sequence of measures v n 
converges to v in law in (F) if for every bounded and continuous function 
H:£oo(F) ->R, WH{v n ) -+R*H(v), where E* denotes the outer expecta- 
tion. 



Definition 2.10 [6]. Let F c B(Loo(Q)), set P to be a probability 
measure on Q, and assume Gp to be a Gaussian process indexed by F, 
which has mean and covariance 

EGp(f)G P (g) = J fgdP- J fdP J gdP 

F is called a universal Donsker class if for any probability measure P, the law 
Gp is tight in £oo(F) and = n 1//2 (P n — P) € £oo(F) converges in law to Gp 
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in £oo(F)i where P n is a random empirical measure selected according to P, 
that is, P n = ±J2t =1 S Xi , where (JQJL l are independent random variables 
distributed according to P. 

Stronger than the universal Donsker property is the uniform Donsker 
property, which is the uniform version of the CLT. For such classes, z/„ con- 
verges to Gp uniformly in P in some sense (see [6, 22] for more details). The 
following result of Gine and Zinn [8] is a relatively simple characterization 
of uniform Donsker classes. 



For every probability measure P on Q, let pp(f,g) = Ep(f — g) 2 — (Ep(/ — 
g)) 2 , and for every <5 > 0, set F s = {f-g\f,g€ F, pp(f,g) < 5}. 



Theorem 2.11 ([8]). F is a uniform Donsker class if and only if the 
following holds: for every probability measure P on £1, Gp has a version with 
bounded, pp-uniformly continuous sample paths, and for these versions, 



The main tool in the analysis of uniform Donsker classes is the Koltchinskii- 
Pollard entropy integral. 

Theorem 2.12 ([8]). // F C B^L^n)) satisfies that 



then it is a uniform Donsker class. 

3. Shattering by Bx* • The goal of this section is to bound the shattering 
dimension of the dual unit ball of a given Banach space. To that end, we 
present the geometric interpretation of the shattering dimension when Q C 
X and F = B x * ■ 

Lemma 3.1. Let X be a Banach space. Assume that {x±,...,x n } is 
e-shattered by Bx* and set E = span{xi, . . . ,x n }. If A is the symmetric 
convex hull of {x±, . . . , x n }, then e{Bx CiE) C A. 

Proof. Let {x±, . . . ,x n } be e-shattered by Bx* and let {s±, . . . ,s n } to 
be a witness to the shattering. Put (cij)" =1 C R, set / = {i\ai > 0} and let x} 
be the functional shattering the set /. For every such I and every i £ I, 



supEsup |Gp(/)| < oo 
p f&F 



lim supE sup \Gp(h)\ = 0. 

P h€F s 




x}(xi) - x} c (xi) > Si + e - [si - e) = 2s, 



and if i ^ I, 



x}(xi) - x*j c (xi) < Si - e - (si + e) = -2e. 
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Thus, 



i=i 



sup 

x*£B x t 



' n \ 
^=1 / 



> 



2 SUp 

x* ,x*aB x * 



(*)• 



Selecting x* = x\ and x* = x} 
(*)>3 



J ( OiXj + OjXj J - xf c I ^ CLiXi + ^ OjXi J 

Kiel iei c ) Kiel iei c ) 



^2ai(x*j(xi) -x*jc(xi)) + ^2(-ai)(x*jc(xi) -x* I (x i )) 



8=1 



Since every point x on the boundary of A is given by x = J27=i a i x i, where 
Ya=i \ a i\ = 1) then \\x\\ = \ J2i=i a i x i\ > £ ) which proves our claim. □ 

Corollary 3.2. The set {x±, . . . ,x n } C -By * s e-shattered by Bx* if 
and only if (xj)f =1 are linearly independent and e-dominate the l\ unit- 
vector basis; that is, for every a±, . . . ,a n G R, ^Z^iLiI *! — II Ya=i a i x i\\ ■ 

Proof. Let E = spanjxi, . . . ,x n } for some linearly independent ele- 
ments of Bx , define T : — > £2 by Tei = Xj and set A to be the sym- 
metric convex hull of {x%, . . . ,x n }. For every / C {1, . . . ,n}, there is some 
v G B^ such that (v,ei) = 1 if i G I and (v,ej) = —1 otherwise. Note that 
(v,ei) = {v,T~ l Te-) = (T~ l *v,Tei) and that A° = (TB?)° = T" 1 ^™ , im- 
plying that T~ l *v G A°. If {xi, . . . ,x n } e-dominate the Z\ unit-vector ba- 
sis, then e(B x nfi)c4 and A° C e- 1 (B x n E)° = e^PeBx*, where P E 
is the orthogonal projection onto E. Thus, there is some x* € Bx* such 
that T -1 v = tPEX* for some < t < e -1 . Hence, (x*,Xi) = (x*,Tei) = 
(Pex* ,Tei) =t~ 1 (T~ 1 v,Tei) >e if i € I. By a similar argument, (x*,Tej) < 
—e if j /, which shows that {xi, . . . , x n } is e-shattered by Bx* ■ 

Conversely, if {xi, . . . , x n } C Bx is e-shattered, then for every a±, . . . , a n G 



i=l 



n 
i=l 



Hence, (xj)™ =1 are independent and e-dominate the t\ unit-vector basis. □ 
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This result enables us to estimate the shattering dimension of the dual 
unit ball of an infinite-dimensional Banach space X when considered as 
a class of functions on Bx- It turns out that the shattering dimension is 
determined by the type of X. 

Definition 3.3. A Banach space X has type p if there is some constant 
C such that for every integer n and every x±, . . . , x n 6X, 



(3.1) 



E 



E 

2=1 



EjXn 



< 



qE 



i/p 



w = l 



where (e«)f =1 are independent Rademacher random variables. The smallest 
constant for which (3.1) holds is called the type p constant of X and is 
denoted by T P (X). 

The basic facts concerning the concept of type may be found, for example, 
in [18]. Clearly, for every Banach space (3.1) holds in the case p= 1 with 
Ti{X) = 1. If p* = sup{p|X has type p}, then 1 < p* < 2, and if p* = 1, then 
X is said to have a trivial type. 

Recall that the distance between two isomorphic Banach spaces X and 
Y is defined as d(X, Y) = inf ||T|| • ||T _1 ||, where the infimum is taken with 
respect to all isomorphisms between X and Y. It is easy to see that if X , Y 
and Z are isomorphic, then d(X, Z) < d(X, Y) ■ d(Y, Z). 



Theorem 3.4. Let X be an infinite- dimensional Banach space. Then 
VC(e, Bx* , Bx) is finite for every e > if and only if X has a nontrivial 
type. If X has type p, then 



ip'^l<VC ( ^-.,Sv)4^) 



p/ip-1) 



+ 1. 



The lower bound and a weaker version of the upper one were established 
in [13] . We repeat the proof of the lower bound for the sake of completeness. 



Proof of Theorem 3.4. If {x\ , x n } is e-shattered, then it e-dominates 
the £i unit-vector basis. By selecting = e^, en < \\ Ya=\ £ i x i\\ ■ On the other 
hand, taking the expectation with respect to the Rademacher variables, 



E 



n 

^ £{Xi 
i=l 



<T P (X) E 



i/p 



x 



i\\X 



< T p (X)n 



i/p 



Thus, there is a realization (£i)™ =1 such that || J27=i £ i x i\\ ^ T p {X)n 1 ^ p . Com- 
bining the two inequalities, n < (T P (X) / e) p ^ p ^ 1 \ 
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Conversely, for every A > and every integer n, there is a subspace 
X n cX such that dimX n = n and d((%*,X n ) < 1 + A (see [18]). Recall that 
d(q,i™„) =n 1 ~ 1 /p* ( S ee [21]), hence, d{X n ,q) < (1 + A)n 1 ~ 1 / p * , and, in par- 
ticular, there are Xi,.. . ,x n C Bx such that for every (a,)f =1 C M, 

1 n 

(1 + AK-Vp* ^ 

Therefore, {xi , . . . , x n } is n^~ p *^ p * (1 + A)~ 1 -shattered by and the 

claim follows by taking A — > 0. 

The assertion in the case p* = 1 follows in a similar manner. □ 

The uGC part of the next corollary was first proved in [7] , and the second 
part may also be known to experts; the proof presented below is new, as far 
as we know. 

Corollary 3.5. Let X be an infinite- dimensional Banach space. Then, 
F = Bx* is a uGC class on O = Bx if and only if X has a nontrivial type. 
Also, for any infinite- dimensional X , F is not a uniform Donsker class 
on f2. 

Proof. The fact that the pair is a uGC class if and only if X has a 
nontrivial type follows from Theorem 3.4 and the characterization of uGC 
classes as classes with a finite shattering dimension at every scale e (see [1]). 

As for the second part, in [8], Example 3.3, it was shown that if X = £2, 
then F = Bi is not a uniform Donsker class on 0, = B2. Moreover, an easy 
modification of the proof reveals the following: if there is a constant C such 
that for every integer n there are spaces X n C X of dimension n for which 
d(X n ,£%) < C, then F = Bx* is not a uniform Donsker class on Q = Bx- By 
Dvoretzky's theorem [18], every infinite-dimensional Banach space has such 
subspaces X n (with a constant C arbitrarily close to 1). □ 

Unlike the infinite-dimensional case, in which the growth of VC(e, Bx* , Bx) 
is determined by the type of X , it is not clear whether the same holds for 
finite-dimensional spaces; indeed, the lower bound in Theorem 3.4 is based 
on the fact that X contains spaces which are arbitrarily close to 1™* for every 
integer n, which is only true for infinite-dimensional spaces. 

4. The shattering dimension of finite-dimensional bodies. It turns out 
that some applications require that the set of functionals F is not the dual 
of the domain but some other convex symmetric set; thus, in the finite- 
dimensional context it is natural to investigate the following question. 



n 

E 

i=i 



QjjXj 
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Question 4.1. Let K and L be two convex symmetric bodies in M. d 
and view the elements of L° as functions on K using the fixed inner product 
in R d . What is VC(e,L°,K)l 

We have shown that VC(e, L°,K) = n if and only if n is the largest such 
that there are n points {x\, . . . , x n } C K for which e(LC\E) C absconv(£i, . . . ,x r , 
where E = spanjxi , . . . , x n } . 

The next theorem provides a general upper bound on VC(e, L°,K) based 
on a volumetric argument. The result is presented for finite-dimensional 
bodies but can be easily extended to the infinite-dimensional case. 

Theorem 4.2. There is an absolute constant C such that for every 
two integers n < m and every two balls K,L C M. m the following holds: if 
{xi, x n } C K is e-shattered by L° , then 

r- c „ T , ^AKnEl 1 ^ 

y/n<—vr((KnE) y— -^t-, 

e (Ln^l 1 /™ 

where E = spanjxi ,x n }. 

Proof. Assume that {x±, . . . , x n } C K is e-shattered by L°. By Lemma 3.1, 
e(L f]E) C A C K n E, where A is the symmetric convex hull of {x±, . . . , x n }, 
and, thus, (K n E)° C A°. By Lemma 2.8, 

\A°\ \ 1 / n 

Cv ^ < vt((K n E)°) cr((K n E)°) < vr((K n E)°) ' 



\(KnE)°\ 

< n gnrj^nM) 1/n < £ „ (( A- n fi^l) 1/n , 

where the last inequality follows from the Santalo and inverse Santalo in- 
equalities. 

□ 

Combining this theorem with Remark 2.3 on the ratio \B p n E\/\B q n £J| 
and Theorem 2.7 on the volume ratio of projections of £ p , the following is 
evident: 

Corollary 4.3. For every 1 < p < q < oo there is a constant C PA for 
which the following holds: if {xi, . . . , x n } C B p is e-shattered by B q i , then 



n 1 ^- 1 , ifl<p<2, 
nVv-VP-V 2 , if2<p<oo. 
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In the sequel we will show that this estimate is sharp. Since a similar 
argument is used in the proof of Theorem 4.10, we shall not present the 
proof of the corollary here. 

Let us mention the following observations: first, using Santalo's inequality, 
vt((K n E)°)\K n E\ l l n < ISrheI 1 ^ ■ Therefore, from the volumetric point 
of view, all that matters is the ratio between the volume of the ellipsoid of 
minimal volume containing the section of K spanned by {x\ , . . . , x n } and 
the volume of L n E. 

Second, estimating the shattering dimension is equivalent to understand- 
ing the behavior of its formal inverse, which, for a given linearly inde- 
pendent set {xi, . . . ,x n } C K, is the largest e > such that e(L n E) C 
absconv(xi, . . . ,x n ), where E = spanjxi, . . . ,x n }. Thus, one can take K = 
TBi, where T : Z\ — ► £2 is defined by Te^ = Xi, and the volume of the ellipsoid 
of minimal volume containing TBf is the significant quantity. 

Finally, if (Aj)" =1 are the singular values of the operator T, that is, the 
eigenvalues of y/T*T, then \£ T B?\ 1/n is equivalent to n~ 1/2 (Ui=i\) 1/n - 

4.1. Shattering and factorization through l\. An alternative way to for- 
mulate the problem of estimating the shattering dimension is as a factoriza- 
tion problem. 

Definition 4.4. For every two balls K and L in R m and every integer 
n < m, let 

T n (K,L)=M PHII-BII; 

the infimum is taken with respect to all subspaces of E C M m of dimension n, 
and all operators B : (E, \\ ■ \\lde) — ► V\ , A — > (E, \\ ■ \\kde) such that AB = 
id: Lf] E ^ K n E. 

The following lemma shows that l/T n (K, L) is the formal inverse of the 
shattering dimension. 

Lemma 4.5. For every integer n and any balls K and L, 

(4.1) I =sup{e|3{xi, . . . ,x n } C K, e(L n E) C absconv(xi, . . .,x n )} 
L n {K,L) 

(4.2) =sup{e|VC(e,L°,ir)>n}, 
where E = spanjxi, . . . ,x n }. 

Proof. If the identity admits an optimal factorization id = AB, set 
A' = A/\\A\\£n_ iKnE and observe that the set {A'e±, . . . , A'e n } C K D E sat- 
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isfies that for any a\, 



\A\ 



?y->ftrn£ ■ 1 1 -B | Urns-Key 



t=l 



> 



> 



B ^2 a iAei 



U=i 



i=l 



El 
i=i 



Hence, absconv(A'ei, A'e n ) C A' n E contains (||^4|| ||.B||) -1 (.L n E) and 

- tt; t -, < sup{e|3{z 1 , . . . , x n } C K, e(LnE) C absconv(xi, . . .,x n )}. 
L n {K,L) 

For the reverse inequality, if {x±, . . . ,x n } C K are such that e(L nfi) C 
absconv(xi, . . . , x n ), define T : W l — > M n byTej = Xj. Clearly, ||T||^n^^ n £: < 
1 and 



IT- 1 ! 



1 



Ln£- 



sup ||T = sup ||as|U^y < -. 



xglde * xeLnE " e 

Thus, HTII/n^nB • llr-^Un^y < 1/e and l/r n (tf,L) > e. □ 

Combining Theorem 4.2 and Lemma 4.5, we obtain the following: 

Corollary 4.6. There is an absolute constant c > such that for any 
two integers n<m and any two balls K, L C W™, 

F n (K,L)>c 



n 



v n (id :K -> L) sup B vr(P E K°) 



where dim(E) = n. 

4.2. Factorization constants of I™. The goal of the next section is to 
investigate the shattering dimension of the class of linear functionals F = 
B™ on Q = B™ for 1 < p, q < oo. First, in Theorems 4.9 and 4.10 below 
we present a tight estimate on the factorization constant of id'.i™ — > 
through Then, we use this result to estimate T n (B™,B™) and, thus, 
bound VC(e, B"?, B™); finally, we show that if 1 < p < q < oo, then F = B q > 
is a uniform Donsker class on 0, = B p . 

We begin with two lemmas needed for the proof of Theorem 4.9. 

Lemma 4.7. Let [i be the Haar measure on the n dimensional sphere 
S n . Set K and L to be balls in W 1 and put a to be such that 



H ( x € S 



n-l 



\x\\k > 



1 



a 



< 



1 

2n" 
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n— 1 



x \\ LO> « )<2-(-+ 1 ) 



// e satisfies that 



Proof. Denote by O n the orthogonal group and let Po n be the Haar 
measure on O n . Set U £ O n and define X{ = aUei. Using the standard con- 
nection between Pq„ and \x on S n , 



Po n {xi G K) = n[ x G 5' 



m- 1 



hence, 



Po„(^ € K for all i) > 1 - n/Jt^x £ 5 n_1 
Moreover, 

P 0n (couv(±aUei) D eL) = P „ sup 

\(^)™ =1 6{-l,l}" 

For every vector (eri, . . . , a n ) G { — 1, 1}", 



i ii i\ 1 

a/ 2 



1 ™ 

— y^fi^e, 

a r— i 

i=i 



< r 



i=l 



>- =Po„ 



> 



a 



a— i 



L c 

a 



S\ n 



Thus, 



P Qn (conv(±af/e;) D eL) > 1 - 2 n fi{ x e S 



-in—l 



£\/n 



a \ 1 

\x\\L° > —j= > 77, 

eJn 2 



and there is some orthogonal operator which belongs to both events. The 
operator T = aU satisfies that ||T'||^«_ > _K- n _E: < 1 and \\T~ ||inE->^? < V e > as 
claimed. □ 

Lemma 4.8. There are constants C p for which the following holds: for 
every integer n, 

^{xeS n - 1 \\\x\\^>C p n llp - 1/2 )<2^ n+1 \ 
if 1 < p < 2 and if 2 <p < oo, then 

/i(x€S n - 1 |||x||/» >C p n 1/p_1/2 ) <e~ n2/P . 
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Proof. Denote by M(B™) the median of \\x\\ p on S n ~ l . By Levy's in- 
equality [18], 



fi(x G 5 n-1 |||a;||p > (1 + t)M(B^)) < exp< 



t 2 nM 2 (B™) 

' 2|M||; 



, On. 

2 



Recall that M(B$) ~ p n 1 ^" 1 / 2 (see, e.g., [18]) and that 

It follows that for 1 < p < 2 and C large enough, depending only on p, 

H{x G S n ~ l \ \\x\\tn > Cn 1/p - 1/2 ) < e - c ? c2n < 2~ {n+1 \ 

while for 2 < p < oo and C depending only on p, 

fi(x G S"- 1 !)^!!^ > Cn 1 / 2 ^) < e- nVp . □ 

The above results will play an important role in the proof of the following 
theorem, in which we construct factorizations of id:£™ — > £™ through l\. 

Theorem 4.9. Let K = Bp andL = B^. Then, F n (K,L) satisfies that 
- n l/2+l/p-l/ 9) i/2<p,g<oo, 



T n (K,L)<C Ptq { 



n 1 " 1 /", ifl<p<2, 

n 1 " 1 / 9 , z/l<g<2<p<oo and p' > g, 



Vp, if 1 < g < 2 < p < oo andp' <q. 



Proof. First, assume that 2 < p < oo and 2 < g < oo. By Lemmas 
4.8 and 4.7 it suffices to choose l/a = C p n 1 ^' p ^ 1 / 2 and select s which satisfies 

that G> 1/9 '~ 1/2 = CX /2 ~ V " = that is, ± ~ g Therefore, 

T n {B^B%)<C m n x f 2 + 1 /v- l li. 
Next, if 1 <p< 2, then 

If 2 < p < oo and 1 < q < 2, one has to treat two cases; if 1 < q < p' , then 
using the identity operator as above, T n (Bp , B™) < n 1-1 / 9 . On the other 
hand, if p' < q < 2, then by the first part of our claim, 

r„(fl£, B£) < \\id\u^ q r n (B;,B^) < c^/p. 

Finally, one has to address the situation when p is infinity. If p = q = oo , 
thenr n (i?-,i?™) = (i(^,C)<Cn 1 /2 [ 21 ]. 

For p = oo we first examine the case q = 2. Let £p(C) to be C n endowed 

with the l p norm and set T = (n _1 ' 2 e 27 ™- jfe ' n )?-' fe=1 . It is easy to check that 

||T||^(C)— (C) < n_1/2 an d that ||T||^n (C)-^(C) < ™ 1/2 - For °ur purpose, 



SHATTERING DIMENSION 



19 



£p (C) can be considered as the £™ sum of two-dimensional Euclidean spaces, 
$>) over the reals, and since for any 1 < p < oo, ||«i||^n(c)-^2n • |M||^2n_^n(c) < 

\/2, then r2n(^2 ?1 > ^oo ) — 2- The case where n is odd is easily reduced to the 
even case. 

Finally, for a general q, 

T n (B^B n q ) < \\id\\^ q T n (B^B%) < C\\id\\ e ^ q , 

as claimed. □ 

The next step in our analysis is to show that the bounds in Theorem 4.9 
are tight. The proof uses the notion of r-summing operators. Recall that an 
operator T : X — > Y is r-summing for 1 < r < oo, if there is a C < oo such 
that 

n n 

(4.3) ^||Txi|| r <C"' sup £|ac*(ii)| r 

i=\ x*eB x * i=1 

for all integers n and all x±, . . . , x n € X. The smallest C for which (4.3) holds 
is denoted by 7r r (T) and is called the r-summing norm of T. 

Theorem 4.10. There exist c VA such that if K = Bp and L = B™ , then 
T n (K,L) satisfies that 



T n (K,L) > c Pi g 



n l/2+l/p-l/9 ( if2<p,q<oo, 
n 1 - 1 ^, ifl<p<2. 



Also, there are c Ptq ^ r such that if 1 < q < 2 <p < oo and r > max{p, q'}, then 

T n (K,L) > c p ^ r n l/r . 

Proof. The first two cases follow from the volumetric estimate as in 
Corollary 4.3. Indeed, if {xi, . . . ,x n } C K is e-shattered by L°, then 

f\K\ \ l / n 

for some absolute constant C. Since K = Bp and L = B™, then (li^l/lLl) 1 /™ 
n i/q-i/ P and vr (^°) ^ n Vp-i/2 f or i < p < 2 and ~ 1 for 2 < p < oo. Hence, 

,l/«-l/p-l/2 



if2<p<oo, 
n 1 /*- 1 , ifl<p<2, 



and the lower estimate on T n is evident from Lemma 4.5. 

For l<q<2<p<oo we can get a better estimate than what the vol- 
umetric estimates provide. We first investigate idii™ — Observe that if 
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AB is a factorization of the identity through I™, then B* A* is a factorization 
of id through P^. A theorem of Maurey (see [21], Theorem 21.4(h)) asserts 
that, for every r > q' , B* is r-summing with ir r (B*) < C q ^ r \\B*\\ and, thus, 
by the properties of ir r , ir r (id) < \\A* \\Tt r (B*) < C qjr T n (B q i ,B q ). 

The behavior of the ir r norm of the identity between £ p and other spaces 
was investigated in [9] and [10]. In particular, in the range we are interested 
in, it is proved in [9] that ir r (id:£q — > £™,) > Cg^n 1 ' 1 " . (For the interested 
reader, we found that the best way to understand this is to apply Theorem 1 
there to our setup. This is rather easy, as is the proof of Theorem 1.) 

This settles the case p = q' . Turning to the general case, assume first 
that 2<q'<p<r< oo. For any factorization AB = id q ^ p , id p _> q >AB is a 
factorization of id q ^ q >. Therefore, for any s > q' , 

C q , r n 1/s < \\B\\\\id p ^ q >A\\ < < \\A\\\\B\\n 1/q '- l/p , 

hence, 

\\A\\\\B\\ >Cg ir n 1/P+1/s_W - 
Choosing s such that 1/r = 1/p + 1/s — 1/q 1 gives the result in this case. 
A similar argument may be used to handle the case q' > p. □ 

Next we estimate T n (B p n ,B' q n ) when n < m. Note that the results we 
obtain are not for the full range of p and q. 

Theorem 4.11. For every integers n<m the following holds: 

1. If2<q<p<oo thenT n {B p n ,B™)~ P)q n l / 2 m l /P- l /i. 

2. Ifq<p<2 thenT n (B r p n ,B^)^ pq n 1 - 1 / p m 1 / p ' 1 / cl . 

3. If p<q andl<p<2 then T n (B™ , B r q n ) ~ p , g n 1 " 1 /-? . 

4. Ifp<q and2<p<oo then T n (B™, B™) n i/2+i/p-i/s . 

Proof. In all cases, the lower bound follows from Corollary 4.6 com- 
bined with the estimate on the volume numbers of id p ^ q in (2.2) and (2.4), 
and the volume ratios of quotients of £ p from Theorem 2.7. 

As for the upper bound, the optimal choice in (1) and (2) (at least when 
n divides m) is the section E spanned by 

k 
i=l 

Then B™ n E = {m/n) l / 2 ~ l / p B™ . Clearly, T n (B™, B™) < T n {B™ n E, B™ n 
E) and when 2 < q < p the latter can be approximated using the proba- 
bilistic argument from Lemmas 4.8 and 4.7. Indeed, a straightforward com- 
putation shows that one can take a = m}l 2 ~ l l p and that e needs to sat- 
isfy that m l / 2 - l / q = a/n l ' 2 e = m 1 ' 2 - 1 ^ /n l / 2 e. Thus, 1/e < n l l 2 m l / p ~ l / q , 
which proves the bound is tight. 
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When q < p < 2 one uses the identity operator as the factorizing operator 
between (m/n) 1//2_1//|3 i?™ and {m/n) l / 2 ~ l / p B p to obtain the required result. 

The upper bound in (3) is obtained by taking the canonical section 
spanjei, . . . , e n } and applying Theorem 4.9. □ 

Some of the information one can obtain from these estimates is summa- 
rized in the following: 

Theorem 4.12. Let l<p<q<oo, set F = B q < and SI = B p . Then: 

1. 

' e -ff/(9-i) 9 ifl<p<2, 
£ -l/(l/2+l/p-l/ff) j if2<p<oo. 



VC(e,F,fi) p - 9 . 



2. F is a uniform Donsker class on fi. 

3. There are constants C PtQ such that for any probability measure fi on B p , 
every integer n and every t>0, 



1 

E M x*--E x *(^) 
n r— f 

i=i 



Pr < sup 

where (Xi)f =1 are independent and distributed according to (j, 



Before presenting the proof, we require an additional lemma which follows 
from Theorem 1.5. Although the first equality is not needed in the sequel, 
it might be useful in other applications. 

Lemma 4.13. For any 1 < p < q < oo there is a constant C Piq for which 
the following holds: if x\, . . . ,x n G B p and T:£ q / — ► is given by Tx* = 
n~ x l 2 Y^l=\ x*(xi)ei, then, for every e > 0, 

log N(e,TB q ,,£2) =log N(e, B q , , L 2 (fi n )) < C M VC(e,B q ,,B p ) ■ log^, 

where [i n is the empirical measure supported on {x\, . . . ,x n }. 

Proof of Theorem 4.12. The first part of the claim follows from 
Corollary 4.3 which yields the upper bound, while the lower one follows 
immediately from Theorem 4.11. 

The second part is evident because, by Lemma 4.13, the class has a con- 
verging entropy integral, which by Theorem 2.12 suffices to ensure that F 
is a uniform Donsker class. 

Finally, the last part follows from the first, combined with Talagrand's 
inequality (Theorem 1.3) and the estimate on the expected deviation in 
terms of the shattering dimension (Theorem 1.6). □ 
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5. The shattering dimension of images of B™. Although the volumetric 
approach yields sharp results in some cases, and, in particular, for T n (B™ , B™) 
for a certain range of p and q, an exact estimate on the factorization constant 
T n (TB™ , B™) does not follow from the volumetric argument, since the posi- 
tion of -B™ is significant, and not only the volume of the ellipsoid of minimal 
volume containing TB™. Indeed, we show that spectral information does not 
suffice for sharp estimates on the shattering dimension. To demonstrate this, 
given a set of (nonnegative) singular values (arranged in a nonincreasing or- 
der) A = (Ai, . . . , A n ), let Ta be the subset of GL n consisting of the matrices 
which have A as singular values. 

Theorem 5.1. For every set A of singular values, 

1 fn 1-1 / 9 , ifq>2, 
sup T n (TB^B-) = -\ q ~ ' 

TgT a A n [ n 1 / , ifq<2, 



and 



To compare this result to the one obtained via the volumetric approach 
(Theorem 4.2), take q = 2, and recall that Theorem 4.2 implies that 

/ n \ 1 / 2n 

r n (TBf,BJ) >cnV 2 ^ JjAr 2 J , 

which, by the means inequality, is weaker than the conclusion of Theo- 
rem 5.1. 

Proof ot Theorem 5.1. By Lemma 3.1, for every T £ GL n , 
r n (TB^B^) = (snp{e\eB^cTB^}y 1 = max \\x\\tb?- 

\\x\\ q =l 

Since (TBf)° = T^ 1 * B^, then for every x, 
\\x\\tb™ = sup (x,y) 

= sup (x,T~ l *y) 

2/6-BS, 

= sup (T~ 1 x,^2e i e i ) 
(ei)? =1 6{-l,l}" \ i=1 I 
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and 



max ||x||:r_B n = sup sup (T 1 x,^2, £ i e i)- 
IMI«=i 1 (ei)2 =1 e{-i,l} n ll*ll,=i\ 



i=i 



By the polar decomposition, T 1 = ODU , where V and O are orthogonal 
and D is the diagonal matrix with eigenvalues A^ -1 . Thus 

inf max IMItb™ 
TeT A ||x|| =l" 1 



inf sup sup ( ODVx, V] 

o,veo„ ( 6l )« =1 e{-i,i}»|W|,=i\ 



i=l 



and 



sup max IpHtb" 
TeT A [|a=ll«=l 



sup sup sup ( ODVx, ^ Sjej ), 

o,veo„ (e i )j =1 e{-i,i} B Nl 8 =i \ 



i=l 



where O n denotes the set of orthogonal matrices on R n . Set (fj,i)f =1 to be 
the eigenvalues of D arranged in a nonincreasing order, that is, fj,\ = X^ 1 > 

Let 



f(0,V) = max max ( ODVx, > e^e; 

M ' ^ (ei)? =1 G{-l,l}»INII,=l\ ^ 



and observe that 



/(O, V) = max max ( x, ^ ( X] ^' fc ft X! efc 



( £ i)Ll Vfc=l 



max 



n / ra 



j=l \i=l I 



Clearly, 



max f(0, V) = max max > 



n in 



max max 



l|2 = V^Vfc 



E 



Eft E ~<°'.> V 3k 
j=l \i=l / 

q'\ V?' 

i=i 



A V?' 
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max max ||xV|| 9 / 

V \\x\\2=Hly/n 



x^O \\x\\2 

from which the first part of the claim follows. 
To prove the second part, note that 



min(/(0,F)) 9 '>minE £ E 



o,v 



k=l 



E E e i°iJ V 0k 
j=l \i=l ) 



mm 

o,v 



k=l 



i=i j=i 



i.i 
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where (ej)f =1 are independent Rademacher random variables. Therefore, by 
Khintchine's inequality, 



n In In 



2\ g'/2 



(*) > min C q J2 E E HYikOij 
fc=i\i=i\j=i / 



Denoting h k = (fijVjk)^, 



2\ 1/2 



\h,0 



k^\2 



and applying Khintchine's inequality again, 

In \ l'/2 



''k 2 



E e jMj^jfc 
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\j=l J 

q'/2 
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By Jensen's inequality and since the matrix (ejVjk)™ '^—i is a l so orthogonal 
for any realization of the Rademacher variables, 

/ n n q'\ W 



min/(0,y)>C 9 min(E £ E 



fc=i 



>C g E £ min(E 



,fe=l 



3=1 



g\ l/q> 



C q mm\\fiV\\ q/ 



and 



min ||/iV|L/ 



\W\2 



1, if 9' < 2, 

„l/3'-l/2 if a '>2. 
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Finally, to see that the lower bound is tight, set O = id, and, thus, 



f (id, V) = max 

(**)£=! 



The sharpness is evident by optimizing with respect to V . □ 
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